智能论文笔记

Unpaired cross-modality educed distillation (CMEDL) for medical image segmentation

Jue Jiang , Andreas Rimner , Joseph O. Deasy , Harini Veeraraghavan

分类：计算机视觉

2021-07-16

CT的精确且鲁棒的肺癌分割，即使是靠近纵隔素的CT，也需要更准确地规划和提供放疗和测量治疗反应。因此，我们开发了一种新的跨模型引发蒸馏（CMEDL）方法，使用未配对的CT和MRI扫描，由此信息教师MRI网络引导学生CT网络来提取信号，以提取信号与背景之间的差异。我们的贡献消除了蒸馏方法的两个要求：（i）通过使用图像（i2i）翻译和（ii）通过使用所有网络的并行培训来使用图像的映像（i2i）翻译和（ii）前进的训练。我们的框架使用了端到端培训的未配对I2I翻译，教师和学生分段网络。使用3个分段和2个I2I网络来证明我们框架的架构灵活性。从不同组患者的377ct和82 t2w MRI培训网络，具有独立验证（n = 209肿瘤）和测试（n = 609肿瘤）数据集。网络设计，将MRI与CT信息组合的方法，在信息（MRI至CT）下蒸馏学习，弱（CT至MRI）和平等教师（MRI至MRI）和消融测试。使用骰子相似性（DSC），表面骰子（SDSC）和Hausdorff距离测量精度，并且在95 $ ^ {Th} $百分位数（HD95）。 CMEDL方法显着（P $ <0.001）比具有CT肺肿瘤的信息教师的非CMEDL方法更准确（DSC为0.77与0.73），MRI具有弱大教师（DSC为0.84 vs.0.81） MRI多器官分割，肺肿瘤，等于教师（DSC为0.90与0.88）。 CMEDL还降低了患者间肺肿瘤细分变量。

translated by 谷歌翻译

Foveated Thermal Computational Imaging in the Wild Using All-Silicon Meta-Optics

Vishwanath Saragadam , Zheyi Han , Vivek Boominathan , Luocheng Huang , Shiyu Tan , Johannes E. Fröch , Karl F. Böhringer , Richard G. Baraniuk , Arka Majumdar , Ashok Veeraraghavan

分类：计算机视觉

2022-12-13

Foveated imaging provides a better tradeoff between situational awareness (field of view) and resolution and is critical in long-wavelength infrared regimes because of the size, weight, power, and cost of thermal sensors. We demonstrate computational foveated imaging by exploiting the ability of a meta-optical frontend to discriminate between different polarization states and a computational backend to reconstruct the captured image/video. The frontend is a three-element optic: the first element which we call the "foveal" element is a metalens that focuses s-polarized light at a distance of $f_1$ without affecting the p-polarized light; the second element which we call the "perifoveal" element is another metalens that focuses p-polarized light at a distance of $f_2$ without affecting the s-polarized light. The third element is a freely rotating polarizer that dynamically changes the mixing ratios between the two polarization states. Both the foveal element (focal length = 150mm; diameter = 75mm), and the perifoveal element (focal length = 25mm; diameter = 25mm) were fabricated as polarization-sensitive, all-silicon, meta surfaces resulting in a large-aperture, 1:6 foveal expansion, thermal imaging capability. A computational backend then utilizes a deep image prior to separate the resultant multiplexed image or video into a foveated image consisting of a high-resolution center and a lower-resolution large field of view context. We build a first-of-its-kind prototype system and demonstrate 12 frames per second real-time, thermal, foveated image, and video capture in the wild.

translated by 谷歌翻译

ORCa: Glossy Objects as Radiance Field Cameras

Kushagra Tiwary , Askhat Dave , Nikhil Behari , Tzofi Klinghoffer , Ashok Veeraraghavan , Ramesh Raskar

分类：计算机视觉 | 人工智能

2022-12-08

Reflections on glossy objects contain valuable and hidden information about the surrounding environment. By converting these objects into cameras, we can unlock exciting applications, including imaging beyond the camera's field-of-view and from seemingly impossible vantage points, e.g. from reflections on the human eye. However, this task is challenging because reflections depend jointly on object geometry, material properties, the 3D environment, and the observer viewing direction. Our approach converts glossy objects with unknown geometry into radiance-field cameras to image the world from the object's perspective. Our key insight is to convert the object surface into a virtual sensor that captures cast reflections as a 2D projection of the 5D environment radiance field visible to the object. We show that recovering the environment radiance fields enables depth and radiance estimation from the object to its surroundings in addition to beyond field-of-view novel-view synthesis, i.e. rendering of novel views that are only directly-visible to the glossy object present in the scene, but not the observer. Moreover, using the radiance field we can image around occluders caused by close-by objects in the scene. Our method is trained end-to-end on multi-view images of the object and jointly estimates object geometry, diffuse radiance, and the 5D environment radiance field.

translated by 谷歌翻译

PS$^2$F: Polarized Spiral Point Spread Function for Single-Shot 3D Sensing

Bhargav Ghanekar , Vishwanath Saragadam , Dushyant Mehra , Anna-Karin Gustavsson , Aswin Sankaranarayanan , Ashok Veeraraghavan

分类：计算机视觉

2022-07-03

我们提出了一种依赖工程点扩散功能（PSF）的紧凑型快照单眼估计技术。微观超分辨率成像中使用的传统方法，例如双螺旋PSF（DHPSF），不适合比稀疏的一组点光源更复杂的场景。我们使用cram \'er-rao下限（CRLB）显示，将DHPSF的两个叶分开，从而捕获两个单独的图像导致深度精度的急剧增加。用于生成DHPSF的相掩码的独特属性是，将相掩码分为两个半部分，导致两个裂片的空间分离。我们利用该属性建立一个基于紧凑的极化光学设置，在该设置中，我们将两个正交线性极化器放在DHPSF相位掩码的每一半上，然后使用极化敏感的摄像机捕获所得图像。模拟和实验室原型的结果表明，与包括DHPSF和Tetrapod PSF在内的最新设计相比，我们的技术达到了高达50美元的深度误差，而空间分辨率几乎没有损失。

translated by 谷歌翻译

Improved Text Classification via Test-Time Augmentation

Helen Lu , Divya Shanmugam , Harini Suresh , John Guttag

分类：机器学习 | 自然语言处理

2022-06-27

测试时间增强 - 跨测试输入示例的预测的聚合 - 是一种改善图像分类模型性能的既定技术。重要的是，TTA可用于改善事后模型性能，而无需额外的培训。尽管可以将测试时间增强（TTA）应用于任何数据模式，但它在NLP中的采用有限，部分原因是难以识别标签保护转换。在本文中，我们提出的增强政策可以通过语言模型进行大量准确的改进。一个关键发现是，增强政策设计（例如，从单个，非确定性增强产生的样本数量）对TTA的好处有很大的影响。跨二进制分类任务和数据集进行的实验表明，测试时间的增加可以对当前最新方法进行一致的改进。

translated by 谷歌翻译

MINER: Multiscale Implicit Neural Representations

Vishwanath Saragadam , Jasper Tan , Guha Balakrishnan , Richard G. Baraniuk , Ashok Veeraraghavan

分类：计算机视觉

2022-02-07

我们引入了一种新的神经信号模型，设计用于有效的大型信号的高分辨率表示。我们的多尺度隐式神经表示（矿工）中的关键创新是通过拉普拉斯金字塔的内部表示，它提供了信号的稀疏多尺度分解，可捕获跨尺度的信号的正交部分。我们通过用小型MLP在每个尺度上代表金字塔的小差异斑块来利用拉普拉斯金字塔的优势。这使网络能够适应从粗尺度到细尺度的能力增加，仅代表具有强信号能量的信号的一部分。每个MLP的参数是从粗到细节优化的，从而在更粗糙的尺度下更快地近似，从而最终是一个非常快速的训练过程。我们将矿工应用于一系列大规模信号表示任务，包括吉吉像素图像和非常大的点云，并证明它需要少于参数的25％，33％的内存足迹和10％的计算时间和10％竞争技术（例如橡子）达到相同的表示准确性。

translated by 谷歌翻译

A Framework for Understanding Sources of Harm throughout the Machine Learning Life Cycle

Harini Suresh , John V. Guttag

分类：机器学习 | (统计)机器学习

2019-01-28

由于机器学习（ML）越来越影响人们和社会，因此还增加了对其潜在的不良后果的认识。为了预测，预防和减轻不良的下游后果，我们何时何地以及如何在整个ML生命周期中引入何时以及如何危害。在本文中，我们提供了一个框架，它在机器学习，跨越数据收集，开发和部署中识别七个不同的下游危害来源。在这样做时，我们的目标是促进周围这些问题的更高效和精确的沟通，以及更加直接的应用程序接地方式来减轻它们。

translated by 谷歌翻译